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Motivated by multiple statistical hypothesis testing, we obtain 
the limit of likelihood ratio of large deviations for self-normalized 
random variables, specifically, the ratio of P(y/n(X + d/n) > x„V) 
. to P(^/nX > x„V), asn-> oo, where X and V are the sample mean 

and standard deviation of iid Xi, . . . ,X n , respectively, d > is a 
constant and x n —> oo. We show that the limit can have a simple 
form e d//z °, where zq is the unique maximizer of zf(x) with / the 
density of Xi. The result is applied to derive the minimum sample 
size per test in order to control the error rate of multiple testing at 
a target level, when real signals are different from noise signals only 
by a small shift. 



k*" 1. Introduction. 

O 

I/"") ■ 1.1. Background. Suppose X\, X2, ■ ■ ■ are iid random variables with den- 
sity /, such that P(X 1 > 0) > 0. For n > 1, let S n = IH h X n . We 

O ' shall consider the biased t statistic 



v^DT - S n 

T n = —r^, with X = — , V 
V n 



1/2 



^ I The choice for T n is only for simplicity of notation. All the results obtained 

for T n in the paper hold for the standard t statistic \Jn — IX /V as well. 

The aim here is to find the limit of the ratio of tail probabilities for T n , 
specifically, the limit of 

P (v^(X + d/n) > x n V) 

— 1 ; — r -, as n — > OO, 

P {^ax > x n V) 

where d > is a constant and x n — > 00 in a suitable rate. The problem 
pertains to large deviations for self-normalized random variables [5l[9]. On 
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the other hand, it is directly related to statistical multiple hypothesis testing, 
in particular, the False Discovery Rate (FDR) control [I], which in recent 
years has generated intensive research due to its applications in microarray 
data analysis, medical imagery, etc, where a very large number of signals 
("null hypotheses") have to be sorted through in order to identify signals of 
interest ("false nulls") from the other, noise signals ("true nulls") [6l[7tl8|[l0]. 

A measure of performance for multiple testing is the fraction of falsely 
identified noise signals ("false discoveries") among the identified ones. Given 
that at least one signal is identified, the fraction is a well-defined random 
variable and its conditional expectation is called positive FDR, or pFDR. 
For a testing procedure, it is desirable that, given a target control level a, 
the procedure attains pFDR < a. However, whether or not this is possible 
depends on the property of the data distributions as well as how much data 
is available to assess the hypotheses. We consider a typical multiple testing 
problem, where the data distributions are shifted and scaled versions of each 
other. 

Suppose the data distributions are Fi(x) = F(siX — iij), where F is a 
fixed distribution, and Sj > and Ui are unknown. In order to identify from 
Fi those with Ui 7^ 0, we test null (hypotheses) Hi : Ui = to see which 
one can be rejected. To this end, let n iid observations be sampled from Fi, 
which can be written as Yn = (Xn +Ui)/si, . . . , Yj n = (Xi n + Ui)/,Si, with 
Xij ~ F. Suppose the nulls are tested independently of each other, so that 
Xij are iid for i > 1, j = 1, . . . , n. Typically, Hi is rejected if and only if the 
t statistic of Ya, . . . , Yi n is larger than a cut-off value x n . Suppose that false 
nulls occur randomly in the population of nulls, such that each Hi can be 
false with probability p G (0, 1) independently of the others, and Ui = u > 
when Hi is false. By definition, a falsely rejected null is a true null, i.e., 
Ui = 0. It is then not hard to see 

1 — p 

(1.1) P '(Hi is falsely rejected | Hi is rejected) = — , 

1 P ~\~ pRn 

where R n is the ratio of tail probabilities 

_ p(VE(x + u)>x n y) 
Kn[u) ~ p{^x > Xn v) ■ 

It follows that the minimum attainable pFDR is equal to the right hand 
side of (jl.ip as well [I] . Consequently, if real signals are weak in the sense 
that u ~ 0, then R n can be close to 1, implying that when a nonempty set 
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of nulls are rejected by whatever multiple testing procedure, it is likely that 
most or almost all of them are falsely rejected. 

For the t test, the only way to address the above limitation on the error 
rate control is to increase n, the number of observations for each null. From 
(jl.ip . in order to attain pFDR < a, n must satisfy 

(1.2) R n (u) > (l/p-l)(l/a-l). 

An important question is, as u ~ 0, what would be the minimum n in order 
for (TO]) to hold. 

The issue of sample size for pFDR control was previously studied in [3]. 
However, in that work the t statistic was defined in a different way, with X 
and V derived from two independent samples instead of from the same sam- 
ple. Although that definition allows an easier treatment, it is not commonly 
used in practice. Furthermore, the asymptotic result in [3] is different from 
the one reported here for the more commonly used t statistic. 

1.2. Main results. We need to be more specific about the cut-off value 
x n . Usually, as n increases, one can afford to look at more extreme tails to 
get stronger evidence against nulls. This suggests there should be x n — > oo 
as n — ► oo. If EX > and EX 2 < oo for X ~ F, then x n should be at 
least of the same order as y/n, otherwise inf pFDR — > 1, where the infimum is 
taken over all possible multiple testing procedures that are solely based on Tj. 
Furthermore, for F = N(0, 1), it is known that there should be x n j ' \fn — > oo 
in order to attain inf pFDR [3] . Based on the considerations, for the general 
case, we will impose x n = a n y/n with a n — > oo as the cut-off value. 

Theorem 1.1. Suppose the density f satisfies the following conditions. 

1) f is bounded and continuous on R and there is 7 > 0, such that 

lim x l+1 fix) < 00. 

2) zf{z) has a unique maximizer zq > 0. 

3) h := log/ is three times differentiable on R, such that sup|/i"| < 00 
and sup | h'" | < 00 . 

Let a n — > 00, such that = oinj log n). Then for any d n — ► d £ (0,oo) ; 

P (X + d n /n > a n V) d/ 

— y-= r '- — >e'°, as n — > 00. 

P (X > a n V) 

Note that for different n, X and V are different random variables. 
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Let fc* = fc* (it) be the minimum n in order for (jl.2p to hold. The asymp- 
totic of k* as u — > is a consequence of Theorem 11.11 



Corollary 1.1. Suppose f and a n satisfy the conditions in Theorem 
[PI Let p G (0, 1) and a £ (0, 1) be fixed in (fL2|) . T/ien 

fc*(u) ~ (^ /«)ln[(l/p- l)(l/a- 1)], as u^0 + . 

Many probability densities satisfy conditions l)-3) of Theorem ll.il for ex- 
ample, Gaussian density fx(x;fj,,a) = e~^ x ~^ '' l 2a ' 1 / '\p2mo~ and Cauchy den- 
sity f2{x; fJ,, a) = <T7r _1 [(T 2 -|-(x — fi) 2 }^ 1 . In particular, when fj, = and a = 1, 
both have zo = 1. Therefore, even though all the moments of f± are finite 
whereas all those of fi are infinite, in terms of the amount of data needed 
to control the pFDR, these two are asymptotically the same. On the other 
hand, Theorem 11.11 is not applicable to densities with zeros on M. Since the 
conclusion of Theorem 1 1 . 1 1 has nothing to do with the continuity of h = log / 
over R, it is desirable to remove condition 3) altogether. 

In the rest of the paper, Section 2 proves Theorem 1 1 . 1 1 and Corollarv ll.il 
Sections 3 and 4 contain proofs of lemmas for the main results. 

2. Proof of main results. A key to the proof is the fact that the 
analysis can be localized at z$, which is revealed by a representation of the 
event {T n > ^/na n } given by Shao [9]. It is easily seen that for t > 0, 



n + t 2 



1/2' 



where Q n = X 2 + • • • + X 2 , 
(cf. [9]). lit = \Jria n , then, letting r = 1 — (1 + a~ 2 ) -1 / 2 and following [9], 

{T n > Vna n } = 




> 



Let z = b/(l — r) and o~ n = \/r(2 — r). Then 
(2.1) {T n > y^a n } = U 2 n > mf-E 

I i=l 



Xi 

z 
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Under the assumption of Theorem ll.il r = 1 — (1 + a" 2 ) -1 / 2 = a~ 2 /2 + 
o(a~ 2 ), and hence <7 2 ~ 2r = a~ 2 + o(a~ 2 ), yielding 

(2.2) cr n — > 0, na\j log n ~ n/(a n log n) — >■ oo. 
Equations (|2.ip and (|2.2p are the starting point of the proof. 

Lemma 2.1. Suppose f satisfies condition 1) and 2) in Theorem \l.l[ Let 
a n — > such that na\j logn — > oo. Then, given r > 0, i/iere is 5 = 5(r) > 0, 
swc/i that 

hm sup t = 1. 

~""*pU> w iff— -A) 

The lemma will be proved later. The following heuristic explains why 
the analysis can be localized at zq. Let d = 0. For <r 2 <C 1, if the event 
Ez = Wn > Sr=i(^i/ 2: ~ I) 2 } occurs, then most of Xi must fall 

between (1 — o~ n )z and (1 + cr n )z, implying 

log P(E Z ) » nlogP(|X - *| < <r n z) « n log(2a n zf(z)). 

As a result, given that at least one E z occurs, the most likely value of z 
should be the maximizer of zf(z), i.e., zq. 

The following fact will be used in the proof of Theorem ll.il If X±, . . . , X n 
are iid with density / and n > 3, then the joint density of X and V is 

„ n 

(2.3) h(t, s) = (V^) n s n - 2 I J] f(t + V^swO^W 

where is the uniform distribution on a (n — 2) dimensional unit sphere 
perpendicular to (1,1,..., 1) in M™, i.e., 

{n n ~\ 

a, G M™ : ]>> 2 = 1, ]>>i = . 
i=l i=l ) 

For completeness, a sketch of the proof of (|2.3p is given in the Appendix. 
Finally, recall that for any a£l and random variables £i , . . . , £ n , 



-Ete-a) 2 = (^-a) 2 + ^ 2 



n t , 



where £ is the sample mean of £j, and Vg = n 1 / 2 '\JY^i=i{£,i ~ 2 ^ s the 
biased sample standard deviation. 
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Proof of Theorem 11.11 Fix d n > such that d n — > d < oo. Given 
r > 0, for n » 1, \d n /n\ < 5, where 5 = 5(r) > is as in Lemma l2.1i It 
therefore suffices to consider the limit of 



L n :- 



p\a 2 n > inf iff— "I 



p{a 2 n > inf -J2 



— - 1 

z 



where Xi :Tl := Xi + d n /n has density f(x — d n /n). Let 

f. o . (t-z) 2 + S 2 

r n = < (i, s) G (—00,00) x I0,oo) : <r n > inf 5 

^ |z-zo|<r 2^ 

Then for any random variables £1, . . . , £ n , 



-z \<r n ~[\ z /J 



\z- 

Apply the above formula to Xi >n and Xi respectively. By (12 . 1 [> and (12. 3ft . 

„ n 

/ s n_2 /(t — <i n /n + v^swj) /x n (da;) cit ds 

J(t,s,u)€V n xUn i= i 



„ n 

/ sn ~ 2 II /(* + V™ S0J i) Hn{du)) dt ds 

J (t,S,U))£rr,.XUr,. .-1 



'(t,s,cj)er n xc/ n i=1 

p(t, s, uj)u{dt, ds, duj), 

i{t,s,uj)£r n xu„ 

where u(dt, ds, duo) is the probability measure on T n x U n proportional to 
g n -2 rj™ =1 f(t 4- ^/nswj) n n {duj) dtds, and 

For each (t, s,u) £ T n x U n , by Taylor expansion, 

s, w) = exp [M* + \fnsuJi — d n /n) — h(t + y/nsuji)] j 

= exp j--^ 1 X] h'(t + A/nswj) + e n | 

where sup( t (Sjt A \e n \ = 0{d^ l /n) = 0(l/n) due to sup x \h"(x)\ < 00. By 
Taylor expansion and uj\ + • • • + uj Tl = 0, 

n 1 n 

— h'(t + y/nsuji) = h'(t) H — h"'(t + 6y/nsuJi)(-\/nsuji) 2 
n r— f n r— f 
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for some 6 £ (0, 1). Because uJ^ cidd up to 1 ctnd. (£, 5) G T n , 



1 n 



~y^h'(t + y/nsuJi) - h'(t) 



n . 
i=i 



< suv\ti"{x)\s 2 < Aa. 



where A = (zq + r) 2 sup a , \h"'(x)\ < oo. For (t, s) G T n , as \t — z\ < a n z for 
some z £ [zq — r, Zo + r], \t — zq\ < r + a n (zQ + r) < 2r for n 3> 1. Combining 
the above bounds, 

e -d„A(2r)- J 4a ? 2 l -sup|e„| < P{% S ' U ) < e d n A(2r)+Aal+sup\e n \ 
— e —d„h'(zo) — ' 

with A(c) — sup| z _2 |< c — /i'(zo)|. Since r is arbitrary and h! is contin- 
uous, from the expression of L n and d n — > d, it is seen that L n ~ e -dh'(z ) 
as n — > oo. Finally, since zo maximizes logz + ^t(^), h'(zo) = —l/z$. So 
L n ~ e d / Z °. □ 

Proof of Corollary 11.11 First, it is necessary to show that asm 
0+, A;* (it) — > oo. To this end, it suffices to show that, when n and c > are 
fixed, then 

P(X + u > cV) 
^■■= P(X>~cV) U l » aS ^° + ' 

where X and V are defined in terms of X±, . . . , X n . The limit follows from 
a corollary to Fatou's lemma, which states that if l n (x) < f n (x) < it n (x), 
^n(^) — ► K 37 )) /nO^) — * an d t* n (x) — ► it(x) pointwise as n — > oo, and 
J l n —> j I and J u n — > / it, then / /„ — > / / . Specifically, let 



> inf ly f^l-iVl for r >0. 



yl(r) = |(xi, . . . ,x n ) : r 2 
Then by (|2.ip . there is cr G (0, 1), such that 

/n 
1 {x G A(cr)} /(xi - «) dxi ■ ■ ■ dx r 
i=i 

„ n 

P(X > cV) = J 1 {x G A((7)} n /(*,) dxi • • • dx n . 



1=1 



Apparently, < 1 {x G A(<r)} EKLi - «) < I1F=1 /O* ~ u )> with the 
right hand side having the same integral as Ylf = i /(xj). Since f(x — it) — > 
f(x) pointwise as it — > 0, the above corollary to Fatou's lemma implies 
P(X + u>cV)^ P(X > cV) > 0. Then £(u) -» 1. 



8 



Z. CHI 



Next, we show that uk*{u) is bounded from oo as u — > 0+. Suppose that 
there is a sequence it, such that Uik*(ui) — > oo. Clearly, rtj := k*(ui) — > oo. 
Then, given any M, Ujnj > M for i ^> 1 and hence by Theorem 11.11 

P(X + n^ > a ni V) > P(I + M/nj > a n ,y) 
P(X > a n Y) ~ P(X > a n V) 

_>e M /*»(l/p-l)(l/a-l), 

which contradicts the definition of /c*(uj). 

It only remains to show that uk*(u) — ► do := Zoln[(l/p — l)(l/a — 1)] 
as u — > 0. It suffices to show that for any sequence Ui — > with convergent 
Uik*(v,i), the limit oluik*{ui) is do- Indeed, let the limit be d. Then, following 
the above argument, e d / z ° = (1/p — l)(l/a — 1), giving d = do- □ 

3. Proof of Lemma 12.11 



Lemma 3.1. Let a G (0, 1), rj > and s > 0. Then 



1 

mf Ay 

s<2<(i+r)c)s n r-f 

2 



2 



1 <C7 2 



1 



n 



i=l 



c <jly <(l + 7? a) 2 (l + ?? )^ 



2^2 



PROOF. Suppose (1/n) X^LiP^A ~ !) 2 - 0-2 for some z £ [«, (1 + ¥ 7 ) s ]- 
Then |X/z - 1| < cr. By < 1 - s/z < r\a and z 2 /s 2 < (1 + r?cr) 2 , 



1 E / 



■;?. 



i=l 



1 



1 



1 



£(*-*) 2 + T" 1 



8=1 



z 2 



1 



y(x 4 -x) 2 + 



i=i 

2 



2 £ 



Z 



1-- + --1 



□ 



with the last expression no greater than (1 + r]a) 2 (l + r\) 2 a 2 . 

In the next, let X\, X%, ... be iid random variables with density /. 

Lemma 3.2. Suppose linx^oo x l+1 f(x) < oo for some 7 > 0. Let a n — > 
such that lim „ n<r n > 0. Then, given T > and 5 > 0, there is a = a(T, 5) > 
0, such that for n ^ 1, 



if 1 n 

sup — log P I cr 2 > inf — y 

\d\<S n { z ^ an f^i 



X l + d 



1 <io g( j n -r. 
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Proof. We first show that there is a = a(T) > 0, such that 



(3.1) 



1 



1 A (X, 



-logP \al> inf-£ — -1 \<\oga n -T. 
n z>a n ~ \ z ) 



Fix 77 G (0, 1) with 77 > (1 + ??/8) 2 (l + r?/4) 2 - 1. Let a n = l + rja n /4. For 
n > 1 with <7 n < 1/2, a n < 1 + 77/8, so by Lemma l3~Tj for any a > 0, 



i=l 



(3.2) 



<E P \a 2 n > inf . ±2 — 

j=0 { aai<z<aa J n +1 Tl i=1 \ Z 

00 r -. n , Y 



3=0 



aot n 



Let s = (1 + i])cr 2 . By Chernoff's inequality, for z > and i > 0, 



1 



(3.3) P 5 >-£ 



1=1 



Xi 



1 < 



* / e ts ^ tu f(z + zu)du 



Fix > lim^^oo x 1+1 f(x). Let M(z) = (7/2) log z and f = M(z) /s. Then 



2 / e 



ts—tu z 



f(z + zu) du < 



A 



27(1 - ^1+7 



+ 2 



|lt|>7/ 



,M(z)-M(z)r,ysf( zu + ^ du 



< 



M(z)-M(z)r; 2 /s 



ZT(1 - 7/) 1+ T y M(z) v - 



/2 



Since e M W = z^l 2 and ^ = y/{\ + 77)^, for z > 1, A < ,4z-T/ 2 <7 n /2. 
On the other hand, z> 1 and a n <C 1, the following (in) equalities hold 

„ 2 

7 2 = z 7(l-W*)/2 < z -7/2 z < ^-7/2^/2, 

so Ji + J 2 < Az^ 2 a n . Then by (£L2])and (1331) . 



£>a 71 r . 



Xi 



1 < 



00 

E [^(«« J n)" 



7/2 



On 



j=0 



(Aa~^ 2 a n r 



1 - (1 + 7?a n /4)-W2 
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Since a n — > and lim „ na n > 0, there is K > such that for all n> 1, 
1 - (1 + 7]a n /4,yy n / 2 > 1 - e -^ nCT "/ 9 > 1/K. Thus 

1 log P (o* > inf " E ( - ~ l) 2 ) < log *n + hg(Aa~^) + 1 



n z>a n ., \ z I n 

Since A and K are fixed independently of a, by choosing a = a(T) large 
enough, (|3.ip is proved. 

Finally, for d 6 [—5, 5] and z > a, 

X t + d y _(z-d) 2 ( X t \ 2 > (a-5) 2 / Xj ^ 2 



z J z 2 \z — d J a 2 \z — d 

Therefore, 



i 2 ' 
1 ' 



1 ^ pJ 2 ^ ■ f If f jW 
sup — log P < a > ml — > 

| d |< 5 rj [ ^ on ^V z 
<-logi^- ^> inf -V — -1 L 

Then the lemma follows from (13.11). □ 



Lemma 3.3. Suppose / is bounded. Let a n — > such that lim n na n > 0. 
Then, given T > 0, there is b = b(T) > 0, such that for n S> 1, 



sup 1 log P <k > inf I £ ( ^ " l) 2 ) < log *n - T. 

deR n o<z<b n f-f V 2 / 



n / v i J \ 2' 

8=1 

\2/i i „/A\2 



Proof. Given rj > such that 77 > (1 + r//8) 2 (l + r//4) 2 — 1, by the same 
argument for (|3.2p . for 6 > 0, d G E and n > 1 with cr n < 1/2, letting 
a n = 1 + 770^/4, 

P { bc-^KzKba-' Z 

<£p(( 1 + ^>itf^^-i N2 ' 

j=0 [ i=l \ / 

Denote A = sup /. For any s > 0, z > and rfsR, 

/ e-Wt-W'fix -d)dx<A I e -^/^?ls dx = Az^Ts. 
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Since the density of Xi + d is f{x — d), by Chernoff's inequality, 

I ^ 6 n i=l V z / J i=0 



1 - (1 + 7/a n /4)-" ' 
By the same argument for Lemma 13.21 the lemma is then proved. □ 

Lemma 3.4. Let < b < a < oo and suppose / is continuous and 
nonzero in a neighborhood of [b, a]. If a n — > 0, then, given r/ > 0, for n> 1, 

1 f 1 n /X \ 2 1 

logP ^ > - £ ( — - 1 < log<x n + log[VfcFi*/(*)] + r?, 



n n f— f v z 

\ i=i 



holds for all z G [6, a] . 

Proof. Fix c > such that 21og(l + c) < rj. Because / is continuous and 
positive in a neighborhood of [b, a], there is s > 0, such that for all z £ [b, a] 
and u € [— s, s], f(z + uz) < (1 + c)f(z). Then 

'l 1 /x x2 " 



!(*):= /exp ---^ --1 \f{x)dx 



z\/e / e 



2 2al 

„2 //o„2 



2 



« /( 2CT n)/(z + nz)dn 



< «Ve(l + c )/(^) / e~ u /(2<jJ du + ze (1 ~ s /<T « )/2 / /(* + uz) du 

J-s J\ u \> s 

< V2^a n (l + c)zf(z) + e ^- s2 /^l 2 . 

By inf ze [ ba ] zf(z) > and a n — > 0, it follows that for n 3> 1, < 
\/27re(T n ,(l + c) 2 zf(z) < \/2irea n e v zf(z). Together with Chernoff's inequal- 
ity, this implies the inequality in the lemma. □ 

To demonstrate Lemma 12.11 we need the following application of the uni- 
form exact LDP of [2]. The result will be proved in the next section. 

Proposition 3.1. Suppose / is bounded on R. Let z > such that / 
is continuous and nonzero at z. Define 



(3.4) h(t) = log 



z / e~ tu f(z + uz) du 



t > 0. 



12 Z. CHI 

Let a n — > such that nu^/logn — ► oo. Then, for each n, there is a unique 
t n > 0, such that h'(t n ) = — cr^, and moreover, as n — > oo, 

(3.5) t n ~ * 



(3.6) fc(t n ) = log a n + log[V27r 2/(0)] + o(l) 

(3.7) p k > - 1 f- - i) 2 l ~ exp{n(a ^ +Mt " ))} 



V J = l ) v 

Proof of Lemma 12.11 It suffices to show that there is 5 = S(r) > 0, 
such that 

[ 2>0, \z-z \>r n ~[\ z 1 

lim sup -, = 0. 

[ n |«- Zo |<r-n^ V ^ 
Denote the denominator by -B(r, d). Given < rj -C 1, when \d\ <C min(r, zo), 

Therefore, it is enough to show that there is 5 > 0, such that 
sup P U n > mf 



|d|<5 { z>0,\z-z \>r n . =1 

1 A /X; 



(3-8) Jim Lir ^ ^ = 0. 



By the assumption of the lemma 



D := log[z f{z )] - sup log[z/(z)] > 0. 

z>0, |2-z |>r/2 

By Proposition 13.11 as long as ij > is small enough, as n — > 0, 

X: ^ 



-iogp\(i- v )a 2 n >-J2 — -1 



= (1 - r/)^ + io g (vT^o-„) + iog[v^F + 0(1) 

(3.9) > M n := \oga n + log[v^rlzo/(zo)] - D/A. 
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Since a n — ► and ncr 4 / log n — ► oo, n<7 n — ► oo as well. By Lemmas 13.21 - 
13.31 there are 6 G (0, zo — a G (^o + r > oo) and <5o > 0, such that 

(3.10) sup -logp(^> inf l£(Xi±*-l) 2 \<M n -D/2, 

Fix < <5 < <5 such that 5 < min(r/2, 6/2, a) and < (1 + 7?)(z - <5) 2 for 
all z G [6, a]. Then 

su P p{a 2 > inf -Vf^-lVi 

\d\<S { b<z<a,\z-z \>r Tl \ Z J ~\Z - d J J 

<p{ sup f^W*> inf 1 Ef--lV) 

U<*<a,|d|<* V^- a/ 6-<5<z<a+<5, \z-z \>r/2 n \ * / J 

^{(^^^r^Kf- 1 ) 2 }' 

where J = [6/2, zo — r/2] U [zq + r/2, 2a]. By Lemma 13.41 and the definition 
of D, as long as rj is chosen small enough, for all n ^ 1, 

sup - log P { (1 + rjf<% > 1 f; - l) 2 ) < M n - D/2. 

Let a n = 1 + r/cr n and iV(n) = [log (4a/ 6) /log a n ] . It is not hard to see 
that J can be covered by the union of at most N(n) intervals of the form 
If. = [xfc,a n Xfe]. By Lemma |3. II and the above inequality, for n S> 1, 

<sp{ (1+ , )4 >infi|:(f-i) 2 } 

<„(„)_p{( 1 + , M >Ig(^-l) 2 } 

and hence 

sup - log P < cr„ > mf - > 1 > 

| d |< 5 n [ 6<z<a,|z-2 |>r n ^ V z J J 



As n — > oo, iV(n) ~ log(4a/6)/(r?tr n ) = C^cr^ 1 ). Since na^/logn — > oo, 
log N{n)/n -> 0. By combining (gTS} - (j3TTT|L (133]) is thus proved. □ 
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4. Proof of Proposition f3. 11 Given z > 0, the log-moment generating 
function of — (X/z — l) 2 is h(t), which is defined in (|3.4p . It is not hard to 
see that for t > 0, h(t) < oo and 

(4.1) h'(t) = J u2 ^ 2 f( z + uz ) du < 0; 

Je tu2 f(z + uz)du 



Ju A e~ tu f(z + uz)du 
J e~ tu2 f(z + uz) du 



(4.2) fc*(t) = J :\ l2 r - W)? > o 



Lemma 4.1. Fix z > 0. Suppose / is continuous and nonzero at z. Also 
suppose sup / < oo. Then for p > —1, 

|u| p e-* tt3 /(» + «2)<i«~/(2)r(p')r l/ , as t^oo, 

withy = (p + l)/2. 

Proof. Given c > 1, there is 77 > 0, such that f(z)/c < f(z+uz) < cf(z) 
for all u € [—Tj,rj]. Write 



\u\ p e- tu2 f(z + uz) du = T + I =h + h- 



JJ J\u\>T) 



By the selection of c 
/(*) 



c 

As i — > 00, 



" |u| p e-*" 2 dn ~ / \u\ p e- tu2 du = T(p')t- p ', 

-rj J 

h < sup/ / |n| p e-* u2 dn = o(t- p '). 

J\u\>T) 

Since c > 1 is arbitrary, the lemma is proved. □ 

Proof of Proposition 13. H Because h" > on (0, 00), h' is strictly 
increasing on (0, 00). By Lemma 14.11 h'(t) ~ — (2i) _1 as t — > 00. Thus, by 
a n — ► 0, for n S> 1, there is a unique t n — > 00 with ct 2 = —h'(t n ) ~ (2t„) _1 . 
This proves (|3.5p . By Lemma |4.1|, 



h(i n ) = log 



z / e" t?l " f(z + uz)du 



\og[(l + o{l))zf{z)^/t^ 



Together with (|3.5p . this implies (|3.6p . 



LIKELIHOOD RATIO OF SELF-NORMALIZED LDF 



15 



It remains to show (13. T|) . For large n, t n is well-defined. Because a^t + h(t) 
is strictly convex, t n = arg inf t>o [cr^t + h(t)]. Let 



fn(x) 



-t n {x/z-l) 2 ~h(t n ) 



It is seen that /„ is a probability density. Let = —(Cnk/z — l) 2 , where 
Cnfc are iid with density f n . Then by (|4.ip 

E(U) = ~J (X/Z - l) 2 f n (x) = - Z J U^-^-^fiz + ZU) du = ti (t n ) 

and likewise by (|4.2|) . Var(£ nfc ) = h"(t n ). Define 
£nl + nh'(t n ) 



(4.3) 



n / y \ 2 

E(--l) , G„(t) = £[e<H t>0 
i=i v z y 



and A n (i) = log G n (t). By checking the characteristic function of £ n i + . . . + 
£ nn , it can be seen that Y n also has the representation 

(4.4) Y n ~ Tn ~ f; (tw) , with P(T n E dx) = e ^- A ^)p(T n G dz), 

and hence characteristic function 

itA' n (t n ) 1 „ / it 



(4.5) £[e** y ' 1 ] = exp 



G n (t n ) 



Since A n (i) = nh(t), then A' n (i n ) = —na\ and, by Lemma I4TT1 and (|3.5p . 
(4.6) A''(t n ) = nh'n(t n ) ~ «/(2^) ~ 2ncr,^, as n -> oo. 

By standard exponential tilting, 

■=P{Tn> -no*} 
l{Y n > 0} e-tny/KMYn 



n \ z 
i=i 



e ncrlt n +A„(t n ) £ 



Therefore, in order to show (|3.7p . it suffices to show 



(4.7) 



E^l{Y n > 0} e 



t n y/2TTA»{t n ) 

The proof is based on the next lemma, which is essentially established in [2] . 
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Lemma 4.2. For each n, let T n be a random variable such that G n {t) = 
E[e tTn ] < oo in a neighborhood of t n £ R. Let A n (t) = logG n (i) and Y" n be 
defined as in (|4.4|) . Suppose that, as n — > oo, 

(4.8) A£(t n ) oo, *X(*n) oo, 

(4.9) Y n SN(0,l), 

and there is 5 > and uq > 1, such that 

(4.10) /* (*) := sup |i?[ e l ' y "]l < 5v^V^)}| e L 1 , 



(4.11) 

Then dO) holds 



sup 

<5<M<At„ 



n>no 

G„,(t„ + iy) 



G n (t n ) 



1 



,VA > 0. 



Proof. Let /3 n = <VA^(t n ) and 6 n = t n ^A^(t n ). Then by (|Q]> . the 
characteristic function of Y n satisfies conditions (2.7) and (2.8) of Theorem 
2.3 in [2], and hence (2.9) and (2.10) there. Then by Y n -» iV(0, 1) and 
Theorem 2.7 in [2], ([47)) follows. □ 



Continuing the proof of Proposition 13.11 it suffices to verify (14, 8p - (14. lip 
for T n defined in (|4,3p . By (|4.6p and the assumption that na n / logn — > oo, 
(|4.8p is clear. To show (|4.9p . consider the representation in (|4.3p . Because 
EY n = and Var(l^) = 1, we only need to check the Lindeberg condition, 
i.e., for any a > 0, 



nE 





£n ~ h'{t n ) 


> a j 


\ Vnh"(t n ) J \ 


^nh"{t n ) 





0, With £ n ~ 5, nk . 



Since h'(t n ) = a n and h"(t n ) ~ 2a n , for n> 1, |£„ - a%\ > a^Jnh"(t n ) 
implies |£ n | > a\fna\ and |£ n — < 2|£ n |. It thus suffices to show 



(4.12) 



E 



{\Cn\ > av 7 ^^}] = o(a n ). 



By the definition of f n , the expectation on the left hand side is equal to 



-h(t n ) 

-h(t n ) 



i - - l) e- tn{x/z - l)2 f{x)dx 

l{x/z-l) 2 >a^n~cjl \Z 



'I u A e~ tnU f{z + zu)du < {zsupf)e~ h{tn) I n , 
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where, by change of variable u = x/\/tn~ and <r^t n ~ 1/2, 

I n := f n 4 e-*"" 2 du < t~ 5 / 2 f x A e~ x2 dx, for n > 1, 

with b 6 (0,a/2) a constant. Since 

/ x 4 e~ x2 dx = r y 3 / 2 e~ y dy ~ (6v^) 3/ V fev/ " = o(l), 

In = o(tn 5 ^ 2 ) = o(a^). On the other hand, by Lemma B~Tl e~ h( - tn) ~ 
l/(z/(z)v / 2^CT n ) = 0{<j- 1 ). As a result, e - h ^I n = o(a n ), yielding <p02|> . 
To show P~TU|) and (j^TT]) . notice that 

G n (t n + iy) 

where <f> n {y) = ze~ h{tn) J e~ tnu2 ~ iyu2 f (z + zu) du. 



Fix < c <C 1. Since / is continuous and nonzero at z, there is r £ (0, 1/2) 
such that f(z)/(l + c) < /(z + uz) and 1 — u 2 < cos u < 1 — u 2 /(2 + c) for 
u 6 [— r, r]. Write 

n (y) = ae"^) / +ze -W») f = I n (y) + J n (y). 

J\u\<yjr/y J\ u \>\/ r /y 

Then for n S> 1 and j/€S, 

cos(yu 2 )e- tnu2 f(z + zu) du 

-\/r/v 



~ p —h(t n ) ry/r/y 

< 1 - — / y 2 u 4 e- tnU f(z + zu) du 



2 -\- C J —^Jr/y 

-h(t n ) 2 ,v/(f vl ) „ , 2 

<l-f^ *L / u 4 e~ tnU f(z + zu)du 

2 + C J-y/r/Wl) 
ze -Ht n ) y 2 f{z) r^J^T) 4 



(i+c)(2 + c ) J-^jm~) 

2(1 + 2c) y-v^T^Ti) 



where the last inequality is due to change of variable, (13.51) and (13. 6|) . Since 
t n — ► oo, by choosing M S> 1/r, for n 3> 1 and \y\ < (r/M)t n , 

3l/ 2 fT 4 

|Re '» fe ) |sl -2(TTt)- 
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On the other hand, by Lemma |4.1|, (|3.5p and (j3.6f) . for b>1 and 



sin(yw 2 )|e tnU " f(z + zu) du 



'r/y 



|Im/ n (y)| < ze"^*" 

< |y|*e -h fr*> / u 2 e~ tn ^ f(z + zu)du 

< V\ + ~c\y\a 2 n . 

As a result, for n 1 and |y| < (r/M)t n , 



(4.13) |/ n (y)| < 



3y 2 < 



2(1 + 3c) 
On the other hand, for n 3> 1, 

|Jn(y)| <«e- fc ^sup/ 



+ (1 + c)yV4 < 1 



1 + c' 



W\>Jr/y 



< (l + c)AP(\Z\ > y/2t^r/y), 

where Z ~ iV(0, 1) and v4 = sup///(z) < oo. Recall that P(|Z| > x) ~ 
y/2/irx~ l e~ x2 / 2 as x — > oo. Therefore, for A/ » 1/r and |y| < (r/M)t n , 
\J n (y)\ < e"*"^ < (y/i n ) 2 /20 < y 2 a A n /A. 
Combining the bounds for I n (y) and J n (y), 

(4.14) \Mv)\ < i - (j^- c - \) y 2 °n < e~ yM/2 , |y| < (r/M)t B . 

To verify (|4.10|) holds for any 5 > and n = n (£) > 1, by 

/ t 



G„(t„ + it/ VAKtnJ) 



G n (t n ) 



Then, letting y = t/y/A£(t n ), by (gH]) , for n > 1 such that (r/M)t n > 5, 
E[e UY »]l{\t\ < VAHU}| = I0n(y)| n l{|y| < 5} < e-"^/ 2 . 

By (|4.6p . the right hand side is no greater than e~ <2 / 9 , which proves (|4.10p . 
To verify (j4TT]) . fix 5 > and first let A < r/M. Then by ([HID , 

Gn(*n + iy) 



sup 

6<\y\<XU 



G n (t n ) 



sup |(/> n (y)r < e~ 52na ^ 2 . 
6<\y\<\t n 



Since nu^/logn — ► oo, the right hand side is o{l/\fn). On the other hand, 
by gS}, t n y/A^j ~ Thus gUJ holds. 
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Finally, let A > r/ := r/M. From the above proof, it suffices to bound 



sup 

ritn<\y\<^t r , 



G n (t n + iy) 



G n (t r 



sup 

ritn<.\y\<\t r , 



j e -^ +i y> 2 f{z + zu)du 



J e tnU f(z + zu) du 



By change of variable u = xj \J2t n and letting 9 = y/t r 



J e -^+^ u2 f(z + zu)du 
J e _ '"'" 2 /(z + zu) du 



- Wx2 / 2 g n (x)dx, 



where g n (x) 



~ x2/2 f(z + zx/j2Q 



-x 2 /2 



f(z + zx/y/2t n ) dx 



For y E M. with \y\ < Xt n , 9 E [— A/2, A/2]. By the continuity of / at z and 
f(z) > 0, g n (x) — ► e~ x / 2 /\Z2tt pointwise. So by dominated convergence 

/ e- idx2 ' 2 g n ( x )dx^^=L= 

uniformly for 9 E [—A/2, A/2]. Given c > 1, for all n>l, 



- Wx2 / 2 g n {x)dx 



< 



\VT+i9\ (l + fl 2 ) 1 /*- 



It follows that 



sup 

ritn<\y\<^tr, 



G n {t n + iy) 



G n (t r 



-n/4 



< c n 1 + 



By choosing c ~ 1, the right hand side is a n for some a E (0, 1), and hence 
is o(l/(i nV /A£(i n ))). The entire fliTT]) is thus verified. □ 

Appendix. To prove fj2.3|) . let e±, . . . ,e n be the standard basis of M. n . 
Let uq = (1/y/n) Ya=i e *' u i> • • • > n n-i e ^ n be an orthonormal basis. Under 
{uj}, the coordinates of J27=i-^i e i are ^b; • • • ? ^n-l; with Yq = s/nX. 
Then X and Y = (Y±, . . . , Y n -i) have joint density 

n n n—l 

g(t, y) = v^n /( x *)> witn X] 3 ^ = V™ tu o + X! y* - "*' y G M n_1 . 

i=l i=l i=l 

On the other hand, V ~ l^l/v 7 ™ and £ := y/|y| E £ n _i = {x E UT^ 1 : 
|x| = 1} almost surely, where | -| stands for the L 2 -norm. Let v be the uniform 
measure on B n _\. By Y = y/nV^, g(t,y) and the joint density k(t,s,z) of 
(X,V,£) with respect to dtdsv(dz) are related via 

k(t, s, z) 



g(t,y) 



with y 



nsz. 
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Since (f) : z — > to = Yh=i z i u i ^ s an isometric mapping from B n _\ to U n , 
<p*v is the uniform measure on U n . Eq. ()2.3p then follows from 

h(t,s) = J k(t, s, z) v{dz) 

= (v^f-V- 2 J g(t,y/naz)v(dz) 

Jb„-i ,. =1 

Jn i= l 
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